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Abstract. Given an /.-function, one of the most important questions concerns its vanishing at 
the central point; for example, the Birch and Swinnerton-Dyer conjecture states that the order of 
vanishing there of an elliptic curve /.-function equals the rank of the Mordell-Weil group. The 
Katz and Sarnak Density Conjecture states that this and other behavior is well-modeled by random 
matrix ensembles. This correspondence is known for many families when the test functions are 
suitably restricted. For appropriate choices, we obtain bounds on the average order of vanishing 
at the central point in families. In this note we report on progress in determining the optimal test 
functions for the various classical compact groups for different support restrictions, and discuss how 
this relates to improved rank bounds. 


1. Introduction 


1.1. Background. While the importance of random matrices in mathematics and related disci¬ 
plines had been noticed at least as early as Wishart’s work liWisI in the late 1920s, for us in number 
theory the story begins with the connections observed by Montgomery and Dyson UMonll in the 
1970s. Montgomery was studying the pair-correlation of zeros of the Riemann zeta function, and 
the behavior was identical to that of certain random matrix ensembles which had been extensively 
studied due to their applicability in nuclear physics. Briefly, characteristic polynomials (and their 
eigenvalues) of the classical compact groups have been observed to model well L-functions (and 
their critical zeros). While we will concentrate on low-lying zeros, i.e., zeros near the central point, 
in families of L-functions, there is an extensive literature on other statistics, including 77-level cor¬ 
relations HHejtlMonllRSl , spacings HOdlllOdH , and moments nCFKRSL See llFMlIHill for a brief 
history of the subject and IlConl IForl IKaSalL IKaSa2L IKeSnll IKeSn2L IKeSn31 IMehl ICTFBl U) 
for some articles and textbooks on the connections. 

In many of the earlier works on the correspondences between the two subjects, the statistics 
studied were insensitive to the behavior of finitely many zeros. This led to the introduction of a 
new statistic, the 77,-level density, as often the zeros near the central point are related to important 
arithmetic quantities, with the Birch and Swinnerton-Dyer conjecture (stating that the order of 
vanishing of the L-function at the central point equals the rank of the Mordell-Weil group of 
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rational solutions) the most famous example. In this paper we concentrate on the 1-level density, 
which we define in detail in ^1.21 We report on recent results from the first author’s honors thesis at 
Williams College, supervised by the second author, where building on methods introduced in HILSM 
optimal test functions are constructed for various statistics for different support ranges. The main 
application of these theorems are improved estimates on the average vanishing at the central point 
for families of L-functions. In addition to being of general interest, such results have important 
applications (for example, in l!lS1 good estimates here are connected to the Landau-Siegel zero 
question). 

In the arguments below we concentrate on the limiting behavior. An important topic for fu¬ 
ture research is to include lower order terms and determine the optimal test functions for various 
regimes where the limiting behavior has not yet been reached. These regimes are quite important 
as they are the ones that can be investigated numerically, and often the data gathered is at odds with 
the limiting predictions as the rate of convergence is abysmally slow. The prime example is that of 
whether or not their is excess rank in families of elliptic curves (see HBMSW 'I for a nice summary 
of data and conjectures); while earlier investigations indicated that such bias might persist, later 
studies ITWI went far enough to see the average rank drop, and new random matrix models have 
been introduced that have the correct limiting behavior and successfully model the observed be¬ 
havior for small conductors HDHKMSll [DHKMS2II . There are now many results on lower order 
terms in families, such as ifHKSl 1MMRWL lMfl2l lYoIl . and the hope is that the methods of this 
paper can be extended to include these to refine estimates for finite conductors. 

On a personal note, the second author investigated questions on rates of convergence with Ram 
Murty in HMMfl (explicitly, proving effective bounds on families of elliptic curves modulo p (for p 
prime tending to infinity) obeying the Sato-Tate Law). It is a pleasure to dedicate this work to him 
on the occasion of his 60 th birthday, and we hope to report on extending our result to lower order 
terms before his next big celebration! 

1.2. n-Level Density. As alluded to above, the behavior of zeros far from the central point ex¬ 
hibit a remarkable universality across L-functions. Unfortunately it is significantly harder to study 
one L-function’s zeros near the central point. The reason is that there are only a few normalized 
zeros near the central point, and there is thus no possibility of averaging if we restrict ourselves 
to just one object (in the extreme case of whether or not the L-function vanishes, we just have a 
‘yes-no’ question). To make progress, we instead study a family of L-functions. The Katz-Sarnak 
philosophy UKaSall [KaSa2 1 states that the behavior of a family of L-functions should be well- 
modeled by a corresponding classical compact groups, with the conductor in the family tending to 
infinity playing the same roll as the growing matrix size; for alternative approaches to modeling 
the behavior of zeros, see IfCFZnicmitGHKl 

We briefly describe the main statistic studied, the n-level density (though we will report on 
progress on the 1-level only here, see fFj for additional results). For ease of exposition we assume 
the Generalized Riemann Hypothesis, so given an L(s,/) all the zeros are of the form 1/2 + 
ijj.j with t j.j real. Our statistic makes sense more generally, but we lose the interpretation of 
ordered zeros and connections with nuclear physics; the main use of GRH is to extend the support 
calculation for many of the number theory computations. We assume the reader is familiar with 
n-level densities; for more detail on these statistics see the seminal work by Iwaniec, Luo and 
Samak IIILSil . who introduced them, or HAAILMZll for an expanded discussion (which formed the 
basis of the quick summary below). 
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Let <f)(x) = r[j=i0i( x i) where each </)j is an even Schwartz function such that the Fourier 
transforms 

rCO 

( 1 . 1 ) $j(y) := (j)j(x)e~ 2mxy dx 

J—oo 

are compactly supported. The n-levcl density for / with test function & is 
C 1 - 2 ) 0i (L njvJ ) ■ ■ ■ <j> n (L njn .j), 

3 £ 7 ^ jm 


where Lj is a scaling parameter which is frequently related to the conductor. Given a family 
T = u n T n of L-functions with conductors tending to infinity, the n-levcl density D„ (J~. 0, w) 
with test function <j) and non-negative weight function w is defined by 


(1.3) D n (T,(i),w) 


lim hf e r N W(f) D n(f, <j>) 

Ar ™ h fe r N w(f) 


Frequently one chooses Tn to be either all forms with conductor equal to N, or conductor at most 
N. 

Unlike the n-level correlations of a family, which have a universal limit as the height of the 
zero tends to infinity, Katz and Sarnak HKaSalIfKaSa2 | proved that the n-level density is different 
for each classical compact group. They were able to obtain closed form determinant expansions; 
while these expressions can be hard to use for n ^ 2 (see mi for a discussion on the benefits of 
an alternative), they are very easy to use for the 1-level. 

Let K(y) := and K e (x, y ) := K(x — y) + eK(x + y) for e = 0, +1. If G N is the family 
of N x N unitary, symplectic or orthogonal families (split or not split by sign), the n-level density 
for the eigenvalues converges as N —> go to 


(1.4) 

where 


ro 0 rco 

• ■ ■ <j)(xi ,..., x n )Wn, G (x 1 ,..., x n )dx 1 • • • dx n 

J — 00 J — 00 

r* 00 rCO ^ 

Hvi, • • •, y n )W ntG {yi, • • •, y n )dyi ■ ■ • dy n , 

J—OO 


W m £o( even) (^) 
^^mjSOfodd) (*^) 

m,0 (^) 

W m9 \j(x) 
(1.5) W m) S p(x) 


det ( K\ (^2 5 

m 

det(A"_i(xj, Xj))ij^ m + J] S( x k) det(iT_ 1 (xj, Xj)) itj¥lk 

k =1 

\W m ,SO(even) (x) + \w m ,SO(odd) (x) 

det (K 0 (xi,Xj)) i,j^m 
det ( K—\ (^25 3 'j')')i 1 j^rrf 


While these densities are all different, for the 1-level density with test functions whose Fourier 
transforms are supported in (—1,1), the three orthogonal flavors cannot be distinguished from each 
other in this regime, though they can be distinguished from the unitary and symplectic. 
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In many of the calculations it is convenient to shift to the Fourier transform side. Letting 


r 

l 

if \u\ 

< 1 

1/2 

if w 

= 1 

0 

V 

if |tt| 

> 1 


(1.6) T}(u) = < 

and r) 0 denote the standard Dirac Delta functional, for the 1-level densities we have 


,—_ 


W l,SO(even) (w) 

= 5 0 (u) + -77 (u) 

__ 

, , 1 

W h0 (u) 

= <5 0 (w) + - 

W l,SO(odd) (tt) 

= <S 0 (u) - u ) + 1 

__ 

1 

WW«) 

= S 0 (u) ^ 

W l:U (u) 

= 5o(w), 


Note that the first three densities agree for |w| < 1 and split (i.e., become distinguishable) for 
|u| ^ 1; alternatively, one could use the 2-level density which suffices to distinguish all candidates 
for arbitrarily small support (see HMil21 ). 

As stated earlier, the Katz-Sarnak Density Conjecture is that the behavior of zeros near the 
central point in a family of L-functions (as the conductors tend to infinity) agrees with the behavior 
of eigenvalues near 1 of a classical compact group (as the matrix size tends to infinity). There is 
now an extensive body of work supporting this for numerous families and various levels of support, 
including Dirichlet characters, elliptic curves, cuspidal newforms, symmetric powers of GL(2) L- 
functions, and certain families of GL(4) and GL(6) L-functions; see for example HDM1LIDM21 
mR-GRllFlMllFH[Gaol[GullHMllHRlinjgllKaSa21lLMllMiIIllMilPeL[nSTUOg2llRRllRollRubl 

lYallYo2l This correspondence between zeros and eigenvalues allows us, at least conjecturally, 
to assign a definite symmetry type to each family of L-functions (see fDM2 . ShTel for more on 
identifying the symmetry type of a family). 


1.3. Main Result. One of the most important applications of the //-level density is to estimate 
the average order of vanishing of L-functions L(s, /) at the central point in a family; this is the 
analytic rank, and is denoted Rank(/). While in some families it is natural to use slowly varying 
weights (such as the Petersson weights for families of holomorphic cusp forms), with additional 
work these weights can often be removed (see HILSfl ). 

If we assume GRH for our family of L-functions, then all critical zeros have real part 1/2. 
Further, if our test function </> is non-negative, then in the 1-level density we obtain an upper bound 
for the average rank by removing the contribution from all zeros not at the central point: 

y fF r u/(/)Rank(/W0) r 00 f 00 ~ ~ 

(1.8) lim - m < <t>(x)W liG (x)dx = </>(y)W 1>G (y)dy. 

N ^°° L/eJ^U) J-oo J-oo 

In practice, we can only establish the n- level density for test functions with restricted support. 
On the number theory side, the goal is to verify the correspondence for as large of support as 
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possible, as we can then use (11.81) to bound the rank: 


(1.9) 


lim AveRankfJ-'y) ^ 

N—>oo v 7 


‘\t, - i(n)W, u ly)rly 

m 


Note that instead of trying to increase the support for the 1-level density we could shift to studying 
higher level densities. While this gives us better bounds for high vanishing at the central point, the 
probability of vanishing to order r or higher decays like c n /r n , unfortunately c n grows with n and 
the result is worse than the bounds from the 1-level density for small r (which are the ones we care 
about most); see IHSH- 

Using the Paley-Wiener theorem to note the admissible test functions are the modulus squared 
of an entire function of exponential type 1 (or its Fourier transform as a convolution), Plancherel’s 
theorem to convert to an equivalent minimization problem, and some Fredholm theory, in Appen¬ 
dix A of HILS1 the optimal test functions are computed for the 1-level density for the classical 
compact groups under the assumption that the support of the Fourier transform is contained in 
[—2, 2]. Our main result is to generalize these computations to larger support and higher n. 


THEOREM 1.1. Let 0 be an even Schwartz test function such that supp(0) a [—2a, 2a], with 
a = s/2. Then for 2 < s < 8 the test function which minimizes the right hand side of (11.8b is given 
by 0 = gg )S * gg/ s . Here * represents convolution, gg/( x ) = ggf(-x), and gg )S is given by 


( 1 . 10 ) 


and 

( 1 . 11 ) 

forQ 


( 1 . 12 ) 


9SO(even),s { x ) 


ci,g, s cos (4 


4o(even) ,s 


COS 


c i,g ,3 

V2 


N 

2 

sin 


N+ 1 ) 


AT ) + C TG,- 


|x| ^ s/2 — 1 
s/2 — 1 ^ |x| < 2 — s/2 
2 — s/2 < |x| < s/2 
\x\ Js s/2, 


goA x ) = 

O, and 


9G,s{ x ) = 


( 1 

M 

I l+s/2 

l 0 

M 


^g,s < 


o ,g,s cos (4 


cos | Jfl + ^ 




^ sin (4r) + 


|x| sc s/2 — 1 
s/2 — 1 ^ |x| ^ 2 — s/2 
2 — s/2 < \x\ < s/2 
|x| ^ s/2 


for Q = SO(odd) or Sp. Here, the c t ,g and A g are easily explicitly computed, and are given later 

in dlU), (EHl (ETH) . (I5T21) . and (EOT) . 


For the rest of the paper, we let a = s/ 2. Unless otherwise 
stated, 1 < a < 1.5, corresponding to the range for s. 

This notation is slightly at odds with other works in the 
literature, where the support of 0 is contained in (—a, a) 
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and for us it is (—2a,2a); we have elected to proceed this 
way as the natural object is g, and the support of </> is 
double that of g. 


Moreover, the optimal function gg s , along with its coefficients c,g s and its scaling factor Xg tS , 
all depend on s. As this will be clear from equations (15.91) to (15.131) . to simplify the notation we 
often omit the subscript s or a, as these are fixed in the analysis. 

To help illustrate the main theorem, we include plots of the optimal g for the groups SO(even), 
SO(odd), and Sp below in Figure Q] and the plots for the corresponding optimal f in Figured we 
do not include the optimal plots for the mixed orthogonal case, as the resulting g is constant. 




Figure 1. Plots of the optimal gg with a = 1.2 (and thus s = 2.4). Left: Op¬ 
timal SO(even) function. Middle: Optimal Sp function. Right: Optimal SO(odd) 
function. 



Figure 2. Plots of the optimal f with a = 1.2 (and thus s = 2.4). Left: Op¬ 
timal SO(even) function. Middle: Optimal Sp function. Right: Optimal SO(odd) 
function. 

As an immediate corollary we obtain the following bounds on the average rank. We isolate 
these upper bounds below. The record for largest support for the 1-level density are families 
of cuspidal newforms IIILSl and Dirichlet L-functions HFiMl (though see also HAMH for Maass 
forms), where we can take a < 2. It is possible to obtain better bounds on vanishing by using the 
2 or higher level densities, though as remarked above in practice the reduced support means these 
results are not better than the 1-level for extra vanishing at the central point but do improve as we 
ask for more and more vanishing (see llHMIl f. 

COROLLARY 1.2. Let T be a family of L-functions such that, in the limit as the conductors 
tend to infinity, the 1-level density is known to agree with the scaling limit of unitary, symplectic or 
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orthogonal matrices. Then for every e > 0 in the limit the average rank is bounded above by 


' 4V2sin(i(3-2 CT ))+2(CT-l) sin(i(-2<T+w+3))+sin(i(2cr+7r-3)) (V2(cr+1) tan(^)+2) 
8\/2 sin( ^ (3—2er)) +8((J— 1) sin( ^ (—2er+7r+3) ) +4V2(T sin ( ^ (2a I tt 3) j tan f ffj 
'2(<r—1) sin(i(2o-+7T—3))—4V2sin( j(3—2cr))+sin(|(—2CT+7T+3)) [\/2((7—3) tan (p^r)+ 2 ) 


(1.13) e + < 


8 (< t — 1) sin( j(2cr+7r—3))+8-\/2 sin( j(3—2<j)) —4-\/2(<j—2) sin( j(—2(7+7r+3)) tan[2__L] 
6(<t— 1) sin(i(2<T+7r—3))+4V2sin(i(3—2o-))+sin(i(—2(T+7r+3)) (^v / 2(5—3cr) tan ( 2 ^i)+2') 
8(<r—1) sin(j(2(7+7r—3))+8\/2sin( j(3—2cr))—4\/2((7—2) sin( j( —2(7+77+3)) tanf 2 ^!-] 

J- + I 

2(7 ^ 2 


g = SO(even) 
G = Sp 
g = SO(odd) 

g = o. 


REMARK 1.3. We only list g and not the optimal test functions or their Fourier transforms 
above, as we do not need either function for the computation of the infimum. IIILSfl show that given 
the g associated to the optimal function, the infimum is given by 


(1.14) inf(g.o-) 


1 

\ a _ a g{x)dx' 


where the integral above exists and is nonzero by (11.171) and (12.41) . both established later. 


A natural choice for a test function is the Fourier pair 

this is the function used for the initial computation of average rank bounds in IIILSI and are optimal 
for o = l. For the groups SO(even), Sp, SO(odd), and for 1 < a < 1.5 the functions we find 
provide a significant improvement for the upper bounds on average rank over the pair (11.151) . We 
illustrate the improvement in Figure [3] which is much easier to process than (11.131) . 


1.4. Sketch of Proof. The first step in our proof is to note that it follows from the Paley- 
Wiener theorem and Ahiezer’s theorem that the admissible functions </>, with supp(©) c [—2cr, 2 o] 
satisfy 

(1.16) (j>(y) = (g*g)(y), 

where 


(1-17) g(y) = g(-y) supp(^) c= [-cr, a], g e L 2 [-a,a]; 


see Appendix A of HILSII . We will sometimes refer to an “optimal g ”. By this, we mean the g that 
satisfies (11.161) and (11.171) for the optimal 0 at a fixed level of support. 

The broad strategy of the proof of Theorem II .11 is to use an operator equation from IIILSll to 
show (non-constructively) that for all a e R + , there exists a unique optimal test function with 
supp(0) <= [—2a, 2a] that minimizes the functional 


(1.18) 


l“ a ,’K x )Wg(x)dx 

m 


We then find a collection of necessary conditions that leave us with precisely one choice for f. 
More explicitly, our argument proceeds as follows. 

(1) We show that certain optimality criterion on f presented in IIILSI holds for all a e R + 
(here supp(</>) c= [—2a, 2a]). 

(2) We show that g is smooth almost everywhere, where g* g = and supp(g) <= (—a, a). 
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0 . 8 ! 


0 . 8 ( 


0 . 7 ! 


0 . 7 ( 


0 . 6 ! 




1.1 


1.2 


1.3 


1.4 


1.5 


Figure 3. Comparison of upper bounds. The larger bound is from using the sub- 
optimal naive guess (11.151) . the lower is from using our results from (11.131) . Left: Q 
= SO(even). Middle: Q = Sp. Right: Q = SO(odd). 

(3) Our kernels give us a series of location-specific integral equations. Using the previous 
smoothness result, we convert those to a system of location-specific delay differential 
equations. 

(4) We solve this system to find an n,-parameter family in which our solution lives. To find 
this solution, we incorporate symmetries of g - namely that it is even. 

(5) Incorporating more necessary conditions on g, we reduce the family to a single candidate 
function - by our existence result this is our g, from which we obtain our optimal test 
function 0. 

From the list above, we will accomplish goal Q] in §[2l goal [2] in §[3l goal [3] in ^4.11 goal 0] in 
^4.21 and goal El in ^5] The proof of the optimal functions for Q = O is significantly easier than 
the proofs for the other functions. We include a brief proof of this fact at the end of CD Finally, 
we conclude with some remarks about how these results are used in number theory, and discuss 
ongoing and future research. 

2. Extension of the Conditions of I1ILSH 

Our first step is to state and extend an optimality criterion on gg, analogous to that in Appendix 
A of HILS1 (we will state it in (12.41) 1. Following their arguments, we seek to minimize the functional 


(2.1) R(g) 


<(I + K g )g,g > 


\(9, 1>| 2 
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where I is the identity operator, 


(2.2) Kg a g(x) 



y)g(y)dy, 


and 

m(SO(even))(£) 

m(SO(odd))(0 
m(Sp)(0 
(2.3) m(O)(0 


^[- 1 , 1 ] (0 

1 - ^[-1,1] (0 

(0 

1 

2 


where I[-i,i] is the indicator function for the interval [—1,1]. 

Lemma 2.1. The operator Kg a is compact for all o e M + , and all choices ofQ. 

Proof. As the functions in (12.31) are all clearly in L 2 ([— cr, a]), they are trace class and there¬ 
fore compact. □ 

It follows that the operator (I + I\g rT ) satisfies the Fredholm alternative for all Q and all a e R + . 
Applying the arguments from llILSl shows that for all a the operator / + Kg jCr is still positive 
definite. Thus there is some g such that 


(2.4) (I + Kg^g) = 1. 

Again, following the arguments of llILSll . one can show that this g indeed minimizes (12.11) . This 
completes the first step. 

We are now ready to find the optimal functions for Q = (). 


LEMMA 2.2. For Q = O, and for any o e M + , the optimal test function for the minimization of 

(11.181) is 


(2.5) <f>{x) = 


f sin(2a7rx) 


\ (1 + cr)7TX / 

and the associated upper bound on average rank is 


( 2 . 6 ) 


1 1 
2^ + 2 


Proof - Optimal functions for Q = O. Using the criterion (12.41) . we can find the opti¬ 
mal functions for Q = O for all o e M + . Trying constant functions, with Kg = 41[_ CT ^j , we see 
that 


f-L- 

M 


) l+cr 


1° 

M 

> 


satisfies (12.41) . By (11.161) and (11.171) . we know f = 9*9- Thus f = (J 7-1 ^)) 2 , where IF 1 
denotes Fourier inversion. Two quick calculations yield (12.51) and (12.61) . □ 
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3. Smoothness Almost Everywhere 

We now show that for an optimal f such that <j> = g * g, g must be Lipschitz continuous. Then 
we show that such a function is differentiable almost everywhere, using a theorem of Rademacher. 
First, we show that g is bounded. 

LEMMA 3.1. Let f Fourier transform of the optimal function, supported in [—2 cr, 2a], then g 
(in the sense of (11.161) and (11.171) 1 is bounded. 


Proof. We show that 


(3.1) h(x) := mg(x — y)g(y)dy 

J —cr 

is bounded. We know that g e L 2 ([—cr, cr]). By the Cauchy-Schwarz inequality, we have 


(3.2) 


r 


m g (x - y)g(y)dy 


< 


L 2 \\m g (x - y)\\ L2 < GO, 


and thus h is bounded. 

By (12.41) . we know that for the optimal g we have g + h = 1. As h is bounded the optimal g 
must bounded. □ 


PROPOSITION 3.2. For any a e M + , the optimal g is Lipschitz continuous. 


PROOF. Using (12.41) and applying the maximum modulus inequality, we see that for aq, aq e 

[-o', <r]. 


(3.3) 


\g(xi) - g(x 2 )\ = (m g (x l -y)-m g (x 2 -y))g(y)dy 

J —cr 

< Irngixi - y) - m g (x 2 - y)\\g(y)\dy 

J—cr 

\g(y)\ \mg{x! -y)- m g (x 2 - y)\dy. 

J—a 


U max 

ye[-a,a] 


We now analyze (13.31) . Notice that for all choices of mg in (12.31) . the integrand is bounded by 
1/2. We will examine the region of integration. Without loss of generality we may assume x\ 5= x 2 . 
Note that our integrand vanishes everywhere except from max {aq - 1, x 2 + 1} to min {aq + 1, a} 
and again from max{—cr, aq — 1} to min{aq — l,aq + 1}- Thus the region of integration has 
measure at most min {2(aq — aq), 4}, and the integrand vanishes outside of a set of measure at 
most 2(aq — aq). 

We may now revise the inequality in (13.31) : 


\d{x\) ~ g{x 2 )\ < max |y(y)| \m g (x 1 - y) - m g (x 2 - y)\dy 

ye[-cr,cr] J_ a 


(3.4) 


< max |y(y)|(2|xi - x 2 \) 

yE[-a,a\ \ Z 

^ max \g(y)\\xi — aq|, 

ye[-cr,a\ 


completing the proof that g(x) is a Lipschitz continuous function. 
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We use a theorem of Rademacher to show that our function g is differentiable almost every¬ 
where. 

Theorem 3.3 (Rademacher (see Theorem 3.1.6 of liFedil )). Let Q c be open. If f : fi —> 

M is Lipschitz continuous, then f is differentiable almost everywhere in Q. 

We immediately obtain the following. 

Corollary 3.4. For all o s M + , the optimal g is differentiable almost everywhere. 

Proof. Let g(x) be g{x) restricted to (—a, o). The result for g(x) follows from Proposition 
13.21 and Theorem 13.31 Thus g is differentiable almost everywhere in (—a, a), which is almost 
everywhere in [—a, a]. □ 

Finally, we show that each such g is in fact infinitely differentiable almost everywhere. 

Lemma 3.5. The optimal g is infinitely differentiable almost everywhere. 

PROOF. We proceed by induction. Our base case, that g is once-differentiable, is established 
by Corollary 13 .41 For the inductive step, we assume that g is A; -times differentiable. 

Note that for any choice of m, we have 

/ rb r/i(z,<r) pLO.O \ 

(3.5) g(x) = 1 - I a 1>g g(y)dy + a 2 , g g(y)dy + a 3 ,e g{y)dy , 

\ Ja Jci Jc2 J 

where the ct* are scalars and f,{x, o) is either a constant or a smooth function of x. In particular, 
for o > 1 at least one of the f t is a smooth function of x. We know that g is continuous. By the 
fundamental theorem of calculus and the chain rule, the expression on the righthand side of (13.51) 
is k + 1 times differentiable. This completes the proof of the inductive step, and thus g is smooth 
almost everywhere. □ 


4. A System of Integral Equations 

To establish our integral equations, we first show that the optimal g is even. 

Lemma 4.1. The optimal g is even. 

Proof. The key to this proof is that any choice of m from (12.31) is even. We show that g(—x) 
also satisfies (12.41) and so must be equal to g. As m is even, we have 


g(-x) + 



y)g(-y)dy 


(4.1) 



m(x + y)g(y)dy 
m(-x - y)g(y)dy 


(/ + Kg iff )(g)(-x), 


which is equal to one. By uniqueness, g(x) = g(—x) and g is even. 


□ 


By the results of the previous section, finding the optimal f for 2 < 2cr < 3 involves finding 
the optimal g for 1 < o < 1.5. We claim, (momentarily without justification) that there are three 

it 













intervals of importance in our study of this function. These are 


J\ ■= [ 0 , o — 1 ] 

J 2 := [o' — 1, 2 — cr\ 
(4.2) J 3 := [2-a, a]. 


As g is even, it suffices to find g on [0, cr], which means finding g on all of the intervals above. 
Examining the kernels in (12.31) and the requirement (12.41) . we see that for x e J 1; the optimal g 
satisfies 

rx+l rl—x rcr 

(4.3) g(x) + a 2j g\ g(y)dy + a 2) g \ g(y)dy + a ljG \ g(y)dy = 1, 

Jo Jo Jo 


and for x e J 2 or J 3 , we have 
(4.4) g(x) + a 2 


rcr rcr 

,g g(y)dy + a 1>g g{y)dy 

Jx-l Jo 


1 . 


In equations (14.31) and (14.41) . we note that a liG = 0 for Q 4= SO (odd) and 1 for Q = SO (odd) and 
a> 2 ,g = 1/2 for Q = SO(even) and —1/2 for Q = Sp or Q = SO(odd). 


4.1. Conversion to Location-Specific System of Delay Differential Equations. Lemma [331 
justifies differentiation of (14.31) and (14.41) under the integral signs, which gives the following system 
of location-specific delay differential equations: 

(4.5) g'{x) + OL 2)Q g{x + 1) - a 2 ,gg(l - x) = 0 

(4.6) g\x + 1) - a 2 ,gg(x) = 0, 

where (14.51) holds for x e ,/), and (14.61) holds for x + 1 e J 2 or J 3 . 


4.2. Solving The System. 

Lemma 4.2. The optimal g satisfies 

Jc t cos (^)+Casing) ifxEj, 

(4.7) g(x) = < v 7 / _ n v 7 / \ 

I Cia 2) g V2 sin ( ^ J - c 2 a 2 ,g\'2 cos ( J + c 3 if x e J 3 

/or some Ci e R. 

Before proving this lemma, it is important to note the following symmetry among our intervals. 
We first set some notation. If a is a number and / is an interval, 

(4.8) a — / := {x : x = a — y,y e I}. 

Note that Jj is one of the intervals defined in 

(4.9) 1-Jr Q J 3 . 

We also mention that 

(4.10) 1 — J 2 = J 2 , 

though we will not use this fact until later. 
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Proof. Differentiating (14.31) yields 

(4.11) g"(x) + a 2: gg'(x + 1) + a 2 ,gg'{l — x) = 0. 

Because of the symmetry (14.91) . we may use equation (14.41) on both the x + 1 and 1 — x terms. This 
gives us the following equation: 

(4.12) g"(x) + oi\g(g(x) + g{-x)) = 0. 

By Lemma |4~TL g is even. Moreover, a 2 ,g = +1/2. So, for any group Q, (14.121) simplifies to 

(4.13) g"(x) + ^g(x) = 0, 

which is a standard differential equation and easily solved. Its solution, which applies to g on Ji, 
is ci cos(x/v2) + c 2 sin(x/v2) for some constants c 1; c 2 . We find the three parameter family for g 
on J 3 by applying (14.61) to this result. □ 


Note that because of the symmetry (14.101) . the associated delay differential equation on interval 
two is different. It is 


(4.14) g'(x) - a 2) gg(l - x) = 0. 

LEMMA 4.3. The delay differential equation (14.141) has a unique one-parameter family of so¬ 
lutions in the class C 2 . That family is 

7r + 2 a 2 ,g' 


(4.15) C\ cos ^ a 2) gx — 
with ci e M. 


Proof. Differentiate (14.141) to obtain 

(4.16) fg(x) = -a 2t gfg( 1-x) = -a\gfg[x) = ~fg(x), 

where we obtain the second equality by applying (14.141) to fg(l — x). The third equality is simply 
a subsitution for a 2: g. However, equation (14.161) is a standard linear differential equation that has a 
two-parameter family of solutions given by 

(4.17) ci cos (0 + c 2 sin (|) . 

We now apply (14.141) to narrow this family down to a one-parameter family. The differential 
equation (14.141) and trigonometric angle addition formulae yield the relation 

i(- ClS m(|) +C2 co S (|)) - ^(^cosQj+^sinQ^cosd) 


(4.18) 


+ 


a 2 ,g 


^Ci sin 





In order for the expression above to vanish, we need the coefficients on cos(.iy / 2) and sin(x/2) to 

V 


both be zero. This translates into the requirement that the vector 
matrix 

2a 2) g cos(l/2) 2a 2t g sin(l/2) — 1 


c 2 


be in the nullspace of the 


(4.19) 


2a 2) g sin(l/2) + 1 — 2a 2> g cos(l/2) 
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Note the matrix in (14.191) has determinant 
(4.20) -4c^ (sin 2 (1/2) + cos 2 (1/2)) + 1 = 0 

because a 2) g = +1/2. We know from HILSH that for each Q the function in (14.151) is a solution to 
(14.141) . From our determinant argument, we know all solutions to that differential equation are all 
scalar multiples of a single nonzero solution, completing the proof. □ 


5. Finding Coefficients 

Substituting values for a t ,g for * = 1, 2, we find 

c i+ cos (vl) 


+ c 2)e sinl M 


|x| + a — l 
a — 1 + Ixl + 2 — a 


(5.1) //SO(cvcn) (C ) '^SO(even) < COS ^2 4 ^ 

.w si "(+)-w <:os (+) + C3 

for Q = SO (even) and 

( / |_| \ / |_| \ 

\x\ + a — l 
a — 1 ^ |x| ^ 2 — a 
2 — a < Ixl ^ a 


ci,acos|Mj +C2eSin (N 


(5.2) g s (x) = Xg < cos 
for Q = SO(odd) or Sp. 


l+04+f)+^«> s (+fi+<3 


LEMMA 5.1. There exist unique, computable coefficients c t ,g. Xg (for i = 1.2,3) so that the 
functions (15.11) and (15.21) satisfy (/ + Kg,a) = 1 and are thus optimal. 


PROOF. We use (12.41) and Lemma 13.51 to find more necessary conditions on such a gg. In 
particular, we impose the three relations: 

(5.3) lim gi,g{x) = lim gg(x) 

x—*-(cr—l)~ £—>(( 7 — 1 ) + 

(5.4) (I + K g )(g s m - (I + K g )(g g )(.5) 

(5.5) (/ + Kg)(g g )(0) = (I + Kg)(gg)(a). 

The first gives continuity, the second and third ensure that (/ + I\g, a ) is constant; however, they 
do not ensure it is 1. That is accomplished by using Xg to appropriately scale down the function. 
This gives us the matrix equations 

/ 


(5.6) 



sin 


( 7—1 

+2 


\75 sin (^) +cos ( 


( 7—1 


0 

V2- ^cos 


0 \ 

0 

- 1 / 


92{<7 - 1 ) 

32(0- -1) 
0/2(0- - 1) - 32(2 


o- 


0(even 

and 



/ 

cos fe) 

(+) 

0 ^ 


cos ( vTj 

0 

0 

V+I sin ( 

+r) +cos (+) 

-v ^ + 75 cos ( vt) 

-v 


32(0- -1) 

32(0- -1) 

,32(0- - 1) - 52(2 - a) 
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(5.7) 


























for Q = SO(odd) or Sp. Here, g 2 is gg restricted to J 2 . 


(5.8) 


Expanding these matrices along the their third columns, we see that 

a — 1 


| ^SO (even) | | ^4sO(odd)/Sp | COS ^ y— ^ 


Sill 


V2 


which are both nonzero for 1 < a < 1.5. Solving the matrix equations, we obtain 

a — l' 


(5.9) 

and 


Cl,SO(even) 

C2,SO(even) 

C3,SO(even) 


O - 1 1 , 1 , , 

cos | —--h —1 — 7T) | sec 


V2 


= 0 


sm | -(2 a + 3n — 3) ) + 


sin (|(—2 a + 371 + 3)) tan 


V2 


Ci t g = COS 

c 2 ,g = 0 


1 — a 


+ 


7T 


sec 


a 


V2 


(5.10) 


c 3: g = sm | -(—2cr + 37T + 3) ) — 


sin Q(2 a + 3-7T — 3)) tan 

V2 


for Q = SO(odd) or Sp. 

We currently have (/ + Kg )0 )(gg) = c for some constant c. Here, gg is the unsealed optimal 
function. As some of our c t ,g are nonzero and the operator (/ + Kg, a) is positive definite, this 
constant is nonzero and it can therefore be scaled to be one. We find the correct scaling factor 
by computing ((/ + Kg,<r)(gg)( 0)) _1 , setting that equal to A g in (15.11) and (15.21) . From these 
computations, we find 


^ g,<r — 9g( 0 ) + 


= 2\/2 


sm 


(5.11) 

for Q = SO (even), and 


+ 2 Sm 




gg(y)dy 

(3 — 2cr) ) + (cr — 1) sin ( -(— 2a + n + 3) 


s — 1 


-(2cr + 7 r — 3)^ ^vO(s + 1) tan ^ 


+ 2 


9g(y)dy 


-2V2sin ^-(3 — 2cr)^ + (cr — 1) cos ^-(2cr + 3-7T — 3) 


+ 2 sin 


-(—2a + 7T + 3) ) ( V2(cr — 3) tan 


a 
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V2 


+ 2 


(5.12) 















for Q = Sp, and 


\q 


A Sp + 



= A S p + 4 (a - 1) sin ( 

(5.13) —2v / 2(cr — 2) sin 

for Q = SO (odd), completing the proof. 


-(2 a + 7r — 3)^ + 4-\/2sin ^-(3 — 2<x)^ 


-(—2a + 7T + 3) ] tan 


a — 1 
^/2~ 


□ 


6. Conclusion and Future Work 

In 0 we include similar calculations for 3 < a < 4. We conjecture that these methods will 
provide solutions for all a e M + . In this pursuit, there are two important steps. The first is solving 
the system of delay differential equations. This gives a family of solutions. The second is taking 
the output of that system and finding the correct system of equations that give us the coefficients 
to pick our optimal g out of that family. Preliminary calculations suggest the second step will be 
more difficult than the first; however, even solving the first problem in general, or providing an 
algorithmic approach, is important progress as it reduces the problem to an optimization over a 
finite-dimensional space, as opposed to an infinite dimensional one. 

While currently the best results (assuming no more than GRH) for showing agreement between 
number theory and random matrix theory for the 1-level density are only for supp(0) c (—2, 2), in 
some cases larger support is attainable through additional assumptions. One example is the slight 
improvement for cuspidal newforms in HILSII under their Hypothesis S. Another are families of 
Dirichlet characters, where Fiorilli-Miller llFiMtt improve up to (—4,4) under some weak assump¬ 
tions about the distribution of primes in residue classes (with stronger ones arbitrarily large support 
is attained). Thus there are already situations where we can gainfully employ these new optimal 
test functions in these expanded regimes. 

Additionally in 0 we hope to generalize these arguments to the n-lcvel densities, and then 
either there or in future work examine the slight modifications needed in the optimal function if 
we have lower order terms. 
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